ON THE MEASURE DIVISION CONSTRUCTION OF A COALESCENTS 
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Abstract. This paper provides a new construction of A coalescents called "measure division con- 
struction" . This construction consists in dividing the characteristic measure A into several parts 
and adding them one by one to have a whole process. Using this method, we investigate the time 
back to the MRCA (most recent common ancestor) of a Bolthausen-Sznitman coalescent perturbed 
by a signed measure. We also study the branch lengths of some other coalescent processes "around" 
Boltausen-Sznitman coalescent. 



1. Introduction 

1.1. Motivation and main results. Let N := {1, 2, • • ■ }, f2 be a subset of N and tt a partition of ft 
sucli that |7r| < +oo (|7r| denotes the mmiber of blocks in tt). Tlic A coalescent process starting from 
TT, introduced independently by Pit man p4] and Sagitov[25]. is denoted by n'^^''^^ := (n(^''^)(t))t>o, 
where n^^-'^' (0) = tt and A is a finite measure on [0, 1]. Here we specify that a finite measure on [0, 1] 
can be a null measure and hence its total mass is a non-negative real value. If tt = {{1}, {2}, • ■ • , {n}}, 
i.e. the set of first n singletons, then the process is simply denoted by n'^'^'") . For any finite measure 
A' on [0, 1], we call the A coalescent process with A = A' "A' coalescent". We assume that all the 
measures throughout this paper are supported in [0, 1]. 

This process is a continuous time Markov chain with cadlag trajectories taking values in 

the set of partitions of fl. More precisely: Assume that at time t, Il'•^''^^t) has b blocks, then after a 
random exponential time with parameter 



(1) I -'-'(^--r'Hd^)^ 

encounters a collision and the probability for a group of fc(2 < k < b) blocks to be merged with 
the other b — k blocks unchanged is 



x'^-^l-xf-'^Aidx) 

(A) ■ 

56 



Then 

.2^ (A) Qlo'x'^-'il-xr'Hdx) 

y^^) {'b,b-k+l (A) ' 

9 b 

is the probability to have b — k + 1 blocks after the collision. 

Remark that if A({0}) = 0, then we get the following well known formula: 

(3) gl^^ = / V - (1 - x)'' - bx{l - xf-^)x-^K{dx). 

Jo 

The definition shows that the law of 11^^.^) jg determined by the initial value tt and the measure A 
which is hence called characteristic measure. 



Date: February 6, 2013. 

2010 Mathematics Subject Classification. Primary: 60J25, 60F05. Secondary: 92D15, 60K37. 

Key words and phrases. A coalescent, measure division construction, branch length, time back to the most recent 
common ancestor, noise measure, main measure. 



2 



LINGLONG YUAN 



Notice that ft can be an abstract set and the coalescing mechanism works all the same. The reason 
why one takes as a subset of N relies on its applications in the genealogies of populations. We take 
n(^'") as an example where = {1,2,--- ,n}. At time 0, we have n(^^")(0) = {{1},{2},-- - ,{n}} 
which is interpreted as a sample of n individuals labelled from 1 to n . If at time t, n^^'"' has its 
first coalescence where {1} and {2} are merged together with the others unchanged, then n(^'")(t) = 
{{1, 2}, {3}, • • • , {n}} which is interpreted as getting the MRCA (most recent common ancestor) {1, 2} 
of individuals 1 and 2 with the others unchanged at that time, etc. Hence {1, 2, • • • , n} is an absorption 
state of n(^'") and is the MRCA of aU individuals. For more details, we refer to pUll^ or pi 151 [Tl [T5] . 

The definition shows that right before a collision, all blocks are exchangeable to be selected for the 
collision. This property is called exchangeability. 

Let 1 < m < 71 and a the restriction from {1, 2, • • • , n} to {1, 2, • • • , m}. We have the consistency 

property: a o II^^'") ='' n^^'™) (see [21]). According to this property, if tt' is a subset of tt, then 
the restriction of II'^^''^) from tt to tt' has the same distribution as that of n'^''^ \ Notice that the 
restriction is from path to path. We can define H^'^''^) when \tt\ = +cxd by using the consistency 
property and the definition in finite case (see [24j). 

Let |n'^'^'")| be the block counting process associated to n''^'"^ Then it decreases from n at time 
0. We denote by r(^'"'the time back to the MRCA of H^^'"^ and by X^^'"^ the decrease of number 
of blocks at the first coalescence. For i G {1, . . . , n}, we define 

i;^^'"' :=inf{t>0|O}^n(^^"'} 

the length of the ith external branch and T^^^") the length of a randomly chosen external branch. 

By exchangeability, t\^''^'^ = T^^'"'. We denote by i^^^f^ := Y.7=iT-^'"^ the total external branch 
length of n^^'"\ and by i'^'") the total branch length. 

There are four classes of A coalescents having been largely studied. We give some results concerning 
one external branch length T'^^'"^ It shows a common regularity that we will discuss later. 

• A = Sq: Kingman coalescent (see [20], [21]). Then nT*-^'"^ is asymptotically distributed with 
density function pq^l:r>o (See g], [7], [11]). 

• A = A'*^'': Bolthausen-Sznitman coalescent (see [S]). Here A'*^** means the Icbesgue measure 
on [0,1]. Then InnT^"^'") converges in distribution to Exp{l) (we denote by Exp{r),r > 0, 
the exponential variable with parameter r)|15j. 

• A{dx)/dx = ^ Beta{a]>) — -'-o<2;<iiO < fl < 1,6 > : Beta{a,b) coalescent. Here Beta{-,-) 
denotes Euler's beta function. Then n^^T*-^'"' converges in distribution to a random variable 
T{a,b) which has density function (X^^^yr^E) (-'^ + {2-ayr{b) ix>a (see [10]). 

• J^^ x~^A{dx) < +00: These processes are called coalescents without proper frequencies. 
This category contains Beta{a,b) coalescents with l<a<2,&>0 (see [23], [H]). Then 

converges in distribution to Exp{l) (see [17], [23] ). 

We see a common property for the last three cases concerning one external branch length which is 
that the normalization factor for t'^'"^ is ^(^'") := J^^^ x~^A{dx). More precisely, 

• Bolthausen-Sznitman coalescent: Notice that /i^'^'") =lnn. Hence directly we have ^''^'"'t'-^'"' 
Exp{l). 

• Beta{a, b) coalescent with < a <, 6 > 0: 

M<^-'= r x-A(dx)= r ff^.-^(i-.)''-^dx= "^^^ritrM ^'^+Q^^)- 

Ji/n A/« r(a)r(5) (1 -a)r(a)r(6) 

Hence ^(A,n)2^(A,«) converges in distribution to T{a, b)r{a + b)/{l - a)r(a)r(6). 
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..(A.n) 

• If L x^^A(dx) < +00, then lim — ; = 1. Hence /i(^'")T('^'"^ converges in distri- 

«^+-/o'x-iA(da;) 

bution to Exp{l). 

Kingman coalescent can be viewed as the informal hmit of Beta{a, b) coalescent with < a < 1, & > 
when a tends to 0, since the measure ^ Beta{a b} — ^o<x<i tends weakly to the Dirac measure on 0. 
The normalization factor in the case of Betaia^h) coalescent is and of Kingman coalescent is 

n. Hence we see that these two factors show also some kind of continuity as a tends to 0. We can 
informally take n as in the case of Kingman coalescent. 

Therefore characteristic for one external branch length in those processes considered. 

Notice that ^(^'") concerns only the measure Al[i/„ ^j, so it is natural to think about the influences 
of measures Al[i/„ and Al[o,i/„) on one external branch length. More generally, if A = Ai + A2, 
how can we evaluate each influence on the construction of the whole A coalescent? To this aim, we 
introduce in the next section the "measure division construction" of a A coalescent. The idea of this 
construction can be at least tracked back to [2] where the authors consider a coupling of two finite 
measure on [0, 1]. 

Let r be a signed measure on [0, 1] (signed measures include measures). According to Hahn decom- 
position theorem (see [3] for example), every signed measure F has a decomposition P and N which 
are two subsets of [0, 1] such that 

• P[]N ^ [0,1], and P[\N ^ 0. 

• VF C P, we have T{F) > 0. 

• VF C iV, we have T{F) < 0. 

We define by ^^(r)(resp. fjL'iT)) the measure such that for any F C [0, 1], we have ^'^{T){F) = 
r(Pn^) (resp. ti-{T){F) = -r(iVnP)). Both ^+(r) and //"(T) are measures. We denote by 
|r| := + /i^(r) the variation of F. Let Fi and F2 be two signed measures and we denote by 

Fi < F2, if F2 — Fi is a measure. Now we turn to the main results. 

Theorem 1.1. Let T he a signed measure and |F| he the variation ofT. We assume that — ln(l — 
x)x~'^\r\{dx) < +00 and A'"^^ + T is a measure. Then the A coalescent with A = A''^'' + F satisfies: 

(4) r(^'") -InhiJi G, 

where G is standard Gumhel distributed, i.e. with distribution function exp{—exp{—x)),x G M. 
Remark 1.1. We can apply this theorem to the Beta{l, b) coalescents with 6 > to get: 

i(r(^'")-lnlnn)^G. 


This closes an open problem posted by Gnedin et al [TB]. 
Theorem 1.2. If A satisfies 

(5) 

then ;i(A,")T(A,n) (^) Exp{l). 

Remark 1.2. Condition ([5]) implies that A({0}) = 0. Indeed, if A({0}) > 0, then gl!"'' > (2)A({0}) 
and < 7iA((0, 1]). Then © is invahd. 

Examples: We give a short list of typical examples satisfying condition ([5]) which are processes 
without proper frequencies or look like "around" Bolthausen-Szitman coalescent. 



yn 



lim , . , 



4 



LINGLONG YUAN 



1 _ gi^'^ 

Ex 1: X- ^A(da:;) < +oo: It sufBccs to prove that lim = 0. Recalling the expression ([3]) 

of 5r/^', we have, for n > 2, 

n n 

_ /i/„(l - (1 - - "^(1 - xr-^)x-'Aidx) - (1 - x)" - nxjl - xr-^)x-^Aidx) 

n n 



The second term Jo ""^'A(dx) ^ J^/'' nA{dx) < J^^'' x-^A{dx) 0. For the first term, let e > 
and M = 1/e , then 

4^x-^A(dx) _ /;,^„x"^A(dx) ^ J^';i"x-^A{dx) 
n n n 

f!,, x~^A(dx) fM/n 

J'J Jl/n 

<e x^^A{dx) + / x^^A{dx). 

Jo Jl/n 

Notice that e x^^A{dx) can be arbitrarily small and /j^/" x^^A{dx) tends to as n tends to +c>o. 

Then we get that -^-^ — tends to 0. Hence if x~^A{dx) < +oo is satisfied, we get condition 

®. 

Ex 2: Bolthausen-Sznitman coalescent: In this case, it is straightforward to prove that gi^"* ~ n — l 

(A) ^ _ 

and ^(A'") = Inn, then lim — ^ — r- — lim — = 0. 

Ex 3: A has a density function /a on [0, r) where < r < 1 and there exists a positive number 
M such that /a < AI on [0,r): This kind of processes can be considered as being dominated by the 
Bolthausen-Sznitman coalescent. 

If Jq x~^A(dx) < +00, we turn back to the first example. If Jq x^^A{dx) = +00, then we have 

gi'^' < 2M(n — 1) for n large enough, hence lim sup — r < lim — -^yr — r-^ = 0. It turns out 

that this kind of coalescents also satisfies condition ((S]). 

Ex 4: A has a density function ff^{x) = p(lni)'' on [0, r) where < r < 1 and p, g are positive 
numbers: Using ([6]), we have 

^ I^/u--'Hdx) ^ Sl'-n-A[dx) 



For two real sequences (a;„)„>i, {yii)n>i, we write Xn x Un, if there exist two positive constants c, C 
such that cyn < Xn < Cyn for n large enough. Then it is not difiicult to find out that x (In n)'^"'"^ , 

1 iM gi"^^ 1 

f, , x^'^A(dx) x nflnri)'', L n'^A(dx) x nfln?!)'^. Hence we get — ^, — 7 x !► 0. 

Jl/n y / y ) ^ Jo \ ; \ / b ^^{Ku) i^n 

Theorem 1.3. If A satisfies condition 1^ and ^^x~^A{dx) = +00, then we have: 
(7) /x^^'") (Tf , , . . . , T(^,") , 0, 0, • • • ) ^ (ei , 62 , • • • ) , 
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where (ei)^^^ are independently distributed as Exp{l). 

Remark 1.3. The same resuh has been proved for Bohhausen-Sznitman coalescent in [9]. The au- 
thors have used a moment method. We can apply this theorem to Example 4 and Example 3 with 
/p x~^A{dx) = +0O. 

The following three corollaries have also been proved for Bolthausen-Sznitman coalescent (see [3], 

Corollary 1.4. If A satisfies condition then for any r G R+, 

lim E[(^^'^'")t(^'"))''] ^E[e{], 

where ei is distributed as Exp{l). Moreover, if x''^A{dx) = +oo, then for any k E N and any 
(ri,r2, ■ • • ,rfc) G (M+)'', wc have: 



i=l 



i=l 



where {ei)i<i<k are independently distributed as Exp{l). 



Corollary 1.5. If A satisfies condition and /g^ x ^A{dx) = +oo, then the total external branch 
length L^J^'^^^ satisfies: /^^^'"'ig^j"^//! converges in probability to 1. 

Corollary 1.6. // A satisfies condition ^ and x^^A(dx) — +oo, then the total branch length 
2^(A,n) satisfies: converges in probability to 1. 

Remark 1.4. In fact, we wiU prove that lim E[/./'^'"'i''^'"V?^] = 1- Notice that Corollarv ll. 41 gives 

n— v+oo 

lim E[^(^'")i^;^f V^] = 1- Hence we deduce this corollary using Corollarv 11.51 

1.2. Organization. In section 2, we introduce the main object of this paper: the measure division 
construction. In this section, we define the restriction by the smallest element which serves as a 
preliminary step to measure division construction. In section 3, we use the established construction 
method to study the MRCA of a Boltausen-Sznitman process perturbed by a signed measure. In 
section 4, we study the lengths of branchs of those processes satisfying the condition ([5]). 



{3}«- 



{4y- 



{2,3,4} 



{1,2,3,4,5} 



{1,2}«- 



■[3,4,5} {1,2,3,4,5} 
{3,5}« 



{4} 



(a) nt'^'S) 



(b) A restriction by the smallest element of n''^'^' from 
{{I},.- - ,{5}} to {{1,2}, {3, 5}, {4}} 



Figure 1. Restriction by the smallest element 
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2. Measure division construction 

2.1. Restriction by the smallest element. Let ^„ = {Ai,--- Xn = {Bi,--- be 
two partitions of {1, 2, • • • , n}. We define sf (resp. sf) as the smallest number in the block Ai (resp. 
Bi). We define also the notation ^„ ^ x„, if |xri| < iCnl and for any 1 < i < \xn\,Bi = IJjg/i 
where {Ii}i<i<\x„\ ^ partition of {1, 2, • ■ • , Roughly speaking, ^„ is finer than Xn- 

If Xn, we define the stochastic process fl'--^^Xn} ^ called the restriction by the smallest element 
of n^A'''"^ from ^„ to Xn- 

• n(A>X")(o) = x„; 

• For any t > 0, if n(A'«")(t) = {A} i<i<|n(A.5,i)|(i), where Di denotes a block, then 

n(A*0(i)^{ (J i?,}i<,<|n(A,5„,|(,), 

G-D, 

where the empty sets in (t) are removed. 

Notice that the restriction by the smallest clement is also defined from path to path (see Figure [1]). 

Lemma 2.1. fK^.x.) has the same distribution as H^A.Xn)^ 

Proof. Every block in Xn is identified by its smallest element which belongs to a unique block in 
Hence for any Bi in Xn, there exists a unique Ar- such that Ar- & ^n, Ar- C Bi and Sr. = sf with 
G {1, 2, • • ■ , |^n|}- Let x'n = {^ri}i<i<|x„| ^ud define a new process Il(A.x„) as follows: 

• For any t > 0, if H(A^«")(t) = {D,} 

i<i<|n(A.5„)|(t), then 
m^'J{t) = { U A.Ji<,<|n(A.5.)i(t), 

s^. G-D, 

3 

where the empty sets in Il^^'^"\t) are removed. 
It is easy to see that II(A,x„) jg a restriction of H^^^?") from ^„ to x'n- By the consistency property, 
we get n(A,x,i) U n(A.x„)^ Because of the construction of n(A,x„) and n(A,x™)^ what is determinant 
is the smallest clement in each block. Hence to obtain H'^^Xti) from n(A.x„)^ at time 0, one needs to 
complete every Ar- by some other numbers larger than to get Bi and then follow the evolution 
of n(A,x;). It turns out that H^^.x.) 

is a coalescent process with initial value Xn- Hence we can 
conclude. □ 

2.2. Measure division construction. Let A, Ai, A2 be three measures such that A = Ai + A2 and 
A{dx) < +00. 

We denote by H^'^j"' •= (ni'^'"'' (0)t>o the stochastic process constructed by the measure division 
construction using Ai and A2. Here the index (1, 2) is for A = Ai + A2 with Ai called "noise measure" 
and A2 " main measure" . 

• Step 0: Given a realization or a path of H^Al"), we set n[^i"\t) = H(Ai'")(t), for any t > 0. 
We set also io = 0. 

• Step 1: Let ti,t2, • • • be the coalescent times after to of the given path of H^'^/'"' (if there is 
no collision after we set ti = +00, i > 1). Within [to,^i), H^^j"^ is constant. Then we run 

an independent A2 coalescent with initial value H^^2"''(^o) from time tg. 

— If the A2 coalescent has no collision on [to, we pass to [ti, t2). Similarly, we construct 
another independent A2 coalescent with initial value H^^2"^(ii) from time ti, and so on. 

— Otherwise, we go to the next step. 

• Step 2: If finally within [ti-i,ti), the related independent A2 coalescent has its first collision 
at time t^ and its value at <* is ^. We set the new (H^^'"^ (t))t>t^ as the restriction by the 
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smallest element of previous {Hi 2" (i))t>t. from previous Hi 2"^ {t^.) to ^. Then we go to step 
1 taking as the starting time to- Notice that by this restriction by the smallest element at 
time t*, conditioning on {H^i^2"^\t*) = C}; 0^i^2^\'t))t>t, has the same distribution as a Ai 
coalescent from time t* with initial value ^, due to Lemma l2. II 

Remark 2.1. • The measure division construction works path by path and is constructed 

by following the time. We also find out that this construction is based on n'^^'"' which is 
random. Hence this construction is established in a random environment. 

• If we take Ai = as noise measure and A2 = A as main measure, then n''^i'"^(t) = 

{{!}, {2}, • • • , {n}} for any t > and n[^''^ nf^'"). 
Theorem 2.2. Let A, Ai and A2 be three measures and A = Ai + A2. Then we have 

Proof. Let t be a coalescent time of n^^2"'' • We consider the time of the next coalescence and the value 
at that moment. In the measure division construction of 112^2 "\ can see appearing two independent 
processes with one a Ai coalescent with initial value II^^j"'' (i) ^'Hd the other one a A2 coalescent with 

initial value II^^j"^ (t) from time t. The process II^^j"'* g^ts the next coalescence whenever one of them 
firstly encounters a coalescence and picks up the value of the process at that moment. Then we follow 
the same procedure from the new coalescent time of 'n\^2^'^ . It is easy to see that IlJ^j"^ behaves in 
the same way as 

n(A,n). Hence we can conclude. □ 

Remark 2.2. The theorem shows that if we exchange the noise measure and the main measure, the 
distribution of the process is not changed and is uniquely determined by their sum. 

Remark 2.3. The measure division construction also works for more than two measures. If there 
are fc(fc > 2) measures {Ai}i<i<fc and A = X^iLi one can get a stochastic process by first giving 
a realization of H^^^'"^ which will be modified by A2 in the way described in the measure division 
construction, and then we apply A3 on the modified process, etc. The equivalence in distribution can 
be obtained in a recursive way. 

We give two corollaries to show some properties of the measure division construction. 

Corollary 2.3. Let 111,112 be two given coalescent paths of respectively two whatever coalescent pro- 
cesses , and jllil < |n2| < +00 at any time. We apply a finite measure A to them using measure 
division construction. Then the time back to MCRA related to Hi is stochastically dominated by that 

0fU2. 

Proof At first, we rename the blocks of ni(0) and of n2(0) as that ni(0) = {{!}, {2}, ■ • • , {|ni|(0)}} 
and 112(0) = {{1},{2},-- - ,{|n2|(0)}}. Then Hi (resp. 112) can be regarded as a coalescent path 
descending from {{!}, {2}, ■ • • , {|ni|(0)}} (resp. {{!}, {2}, • • ■ , {|n2|(0)}}). We wiU modify Hi (resp. 
112) to H'l (resp. 112) using the finite measure A and the measure division construction. At first we 
set n; = Hi (resp. = II2). Let si = miii{t\Il[{t-) ^ n'i(t), or n'2(t-) ^ n^(t)}. Then we run 
an independent A coalescent with initial value 112 (0) (renamed) . 

• Step 1: If the A coalescent has its first coalescence within [0, si), we apply it to IIj in the 
same way as described in the measure division construction and also to n'^ using restriction. 
Hence we get the new H'^ and H2 and we do the same thing taking si as a starting point. 

• Step 2: Otherwise, we pass to [si,S2), where S2 = min{t > si|Hj(t— ) ^ Hi(t), or n2(i— ) ^ 
\^2^y\- We construct another independent A coalescent with initial value H2(si) and go to 
step 1. 

Notice that the modifications are coupled and if we consider single Hi or H2, due to consistency 
property, each path is modified following the same procedure as described in the measure division 
construction (especially for Hi). It is evident to see that the coupling let Hj arrive to its MRCA 
earlier than Hj to its own MRCA. Then we can conclude. □ 
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We denote by the stochastic domination between two real random variables. The following 
corollary is the same as Lemma 3.2 in [2]. But we prove it again in our way. 

Corollary 2.4. Let Ai, A2 be two finite measures such that Ai < A2, then we have r^"^^'"^ ^ T-(Ai,n)_ 

Proof. The variable t^^^'"''> can be viewed as the time back to the MRCA of the measure division 
constructed process with the noise measure Ai and the main measure null. The variable 
be viewed as that of the process with noise measure Ai and main measure A2 — Ai. Since the measure 
division construction can be established from path to path for both using the same realization of 
so we can conclude. The proof can also be done using Corollarv l2.3l □ 



2.3. A tripling. We often have some results on the coalcscent related to a special measure, for 
example, the Beta coalescent. When the process is perturbed by a noise measure, we would wonder 
whether this damage is negligible. One example is to estimate the number of blocks of the coalescent 
related to the noise measure after a certain time (if this noise measure is a signed measure, one can 
study its variation). To this aim, we use the tool of tripling. 

Tripling: Notice that n'^^^") encounters its first collision after time e^"'', which is a random variable. 
At this collision, the number of blocks is reduced to n — VF^"'' , where Wi"^ is random. Then we add 
new blocks (these blocks can contain any numbers belonging to {n + 1, n + 2, • • • }) and consider 

(n) 

the whole new n ones. By the consistency property, the evolution of the original n — Wi blocks 
can be embedded into that of the new n blocks, i.e. after time ej"^ we have the collision in the 
new n blocks whose total number is reduced to n — W^2"-' and we can calculate the distribution of 
the number of blocks coalesced among the original n — Wi^^ blocks (we call any block containing 
at least one of {1, 2, • • ■ , 71} as "original block" and it is very possible that nothing happens for the 
n — Wi"'' blocks). Then we add again new blocks containing different elements to have another n 
ones. This procedure is stopped when every element of {1, 2, • • • , n} is contained in one block. By 
the definition of A coalescent, (e|"^)i>i are independent exponential random variables with parameter 
g'i^^ and (W^/"^)i>i are i.i.d copies of x[^'"\ 

The above procedure gives a tripling of (e|"^),>i, (Wi^"^),;>i and B^^^"). We define k/"^ := 
J2]=i i G N. Then we have the following proposition: 

Proposition 2.5. Suppose that (e|"^)i>i, (Wj^"^)i>i and 

n(A,„) 

are tripled, then at any time t > 0, 

we have 

N{A,n,t) 

(8) E W^i"^ < |B(^^")|(t), 

1=0 

where N(A,n,t) :— cardlilV^^""^ < t}, which is Poisson distributed with parameter gif^^t and inde- 
pendent of {wj^'^^)i>i. Meanwhile, 

(9) E[W^/"'] ^ ^ -j- — '- ^ - 1, and E[{wj;"' f] = — ' -j^ ^ -E[W^^"Y 

Qn gk 

Proof. The number of is within [0, t] follows the Poisson distribution with parameter gl!^^t. Due to the 
tripling, at any time T^*-"-* with < y/"'' < t, the decrease of number of blocks (i.e. jB^"^'"^ |(y/"'' — ) — 

|n(A,«)|(y^("))) 

among original blocks is less than or equal to W^'"^ Bence we get ([5|). Notice that 
yy_(ri) (rf) jj^(A,n)^ i\\en ([2]) is a consequence of two equalities in [S] with Eq (17) for the first one and 
p. 1007 for the second one. □ 
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3. Application 1 : Theoren fTll 

3.1. Annihilator process. In Theorem 1 1.11 the main measure A''^'' is perturbed by a signed measure 
whose variation is related to a coalescent process without proper frequencies. To estimate the damage 
made by the noise measure, we first recall the definition of annihilator processes. 

Let A be a finite measure on (0, 1] and /o x-^A{dx) < +00. The annihilator process related to 
A, denoted by was introduced in [IT]. It is a cadlag continuous time Markov process taking 

sets of blocks which are subsets of {1, 2, • • • , n} as values. At time 0, II^^'") (0) = {{1}, {2}, • ■ • , {n}}. 
As time goes by, wc remove several blocks uniformly until its value equals empty. More precisely, let 
|n(^'")| be the block counting process of H^^-") and if at time t, |n(^'")|(t) = b with 6 > 1, then it 
encounters the next remove after an exponential time with parameter: 

and the probability to remove a group of fc(l < k < b) blocks with the others unchanged is: 

9 b 

There are two constructions given in |17j . Both of them concern the coupling between the coalescent 
and annihilator process. 

First construction: The general case with x~^A{dx) < +oo. This construction is mainly due 
to the remark in section 3 of |17j . Let {St)t>o be a subordinator with the Laplace transform 

(10) E[e-^^~*] = e-**(^' , t > 0, A > 0, 

where the Laplace exponent 'i'(A) = /q^(1 — (1 — x)''^)x~'^A{dx). Define St := e~^K Then St is a 
non-increasing non-negative process with value 1 at time 0. Let (wi)i<i<„ be n independent uniform 

random variables on [0,1], then (fl'^'"^ (t))f>o {i\ui < 5f, 1 < i < n}. Due to a coupling not 
recalled here between Il(^'") and n''^'"^ we can find a probability space where we have for any i > 0, 

(11) |n(^'")|(t) < |n(^-")|(t). 

Second construction: The case with x~^A{dx) < +oo. This one is due to section 2 of [T7] . 
Here we recall the precise coupling. Let (?7|^'')i>i be independent random variables following the 
distribution of rf -^^l!^] > and {e'^^)i>i i.i.d copies of Exp{(} x~'^ A{dx)). At time 0, we have n 

Jo ^ A[^ax} 

singletons from 1 to n which are called primary. At random time e^^"*, conditioning on ri'-^\ every 
block is independently marked "Head" with probability TyJ^-* and "Tail" with probability 1 — 'n'l^^ . 
Those blocks marked " Head" are merged into a bigger block called secondary, provided that there are 
at least two "Head"s. If there is only one "Head", wc call this block secondary and do nothing else. 
Then after another time e^2^\ we do the same thing, and so on. This construction gives a coupling 
between the whole block process and secondary block process, with the former being H^^'") and the 
latter n(A."). it is evident to see that, in this construction, |n(^'")|(i) < |H(^^")|(t), for any t > 0. 

For i > 1, we call X)}=i ^j^^ ^^le i-th marking time and (L^^'"'*\j > 1) the i-th selection time of 
{j} when {j} is marked "Head" for the i-th time (we set ij^'"'*' — -foo, if {j} has no i-th selection 
time). Notice that (ij'^'"'"'^'')i<j<n are exchangeable. It is easy to see that i''^'"'-'^' is exponentially 
distributed with parameter x^^A{dx). It is shown by the construction that 

(12) ^(A,„,l) <y^(A.n)^ 

Remark 3.1. If x~^A{dx) < oo, wc define the probability p(^^") for singleton {1} of H^"^'") not to 
be coalesced at its 1-st selection time and define also (A-'^').j>i := i'r]i^^^]=\{^ — ''l\^''))i>i- Notice 
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that the 1-st selection time of singleton {1} is an exponential time with parameter x~^A{dx). To 
see whether singleton {1} is coalesced or not at its 1-st selection time depends on whether there exists 
at least another block marked "Head" at the same time or not. The construction shows that at the 
1-st selection time of singleton {1}, there are more blocks waiting to be marked in coalescent case 
than in annihilator case. Hence is less than or equal to the probability for singleton {1} of 

n(A,n) coalesced at its 1-st selection time which equals exactly J2t^ ~ ^i^'')""^]- 

In short, P(^'") < EtTE[A|^^(l - a'^^^)"-^]. 

Lemma 3.1. Let H^^'"' be a coalescent with — ln(l — x)x~^A{dx) < +oo. Then for any r > 0, 
there exists m > 0, such that 

lim P(|H^^^")|((lnlnn)'') > ne-'"^^"'"")'^) = 1. 

n— f +00 

Proof. Notice that A satisfies x~-^A{dx) < +oo. Hence we can use the first construction above 
where we pick up the notations. The dominance gives that it suffices to prove 

lim P(|n(^^")|((lnlnn)'') > ne-"('"'"")'^) = I. 

n— ^+oo 



A derivation of pU| on A gives that 

E[Ste-^^'] = e"**'^) / -t{l - x)^ ln(l - x)x-^A{d: 
Jo 



Then let A tend to to get E[5t] = —t ln(I — x)a; ^A((ia;) < +00. Hence, it follows that ^ converges 

S(lninnV 

(In In n)^ 



~ S r 

in probability to K[Si] due to the law of large number. In particular, we have "-.'^ converging in 



probability to E[S'i] which implies that _E[g'"]('i'n"i!i,i)'- converges in probability to 1. 

Let Xi := lu,<s^u,l„„)rA < i < n. Hence given p 5(ini„„)r, (Xj)i<i<„ are n independent 
Bernoulli variables with parameter p. We get, due to the subordinator based construction, 

n 

(13) |n(^'")|((lnln7i)'-). 

i=l 

Chernoff's inequality says that for any < e < p, with p given, 

n 

where D{x\\y) = x In | + (1 - x) In i^, < a; < y < 1 (at the boundary, we set D(0||0) = ^(llll) = 
and let D{x\\y) extend continuously to other points). We choose e = p — p^, then D{p — e\\p) = 
D(p-^\\p)=pHnp+il-p^)\n^>p^lnp+{l-p^){-p-'+p-3^)=p{l-p-'+plnp)~^il-p''). 
Hence conditioning on p, we get 



(14) 



' <p^\p) < e-(J'(i-p'+pi"PlP)-T-(i-p'))", 



where the right-hand term converges in probability to 0, since _E[Sii(ininn)'- — -Eisiuininn)'- 
verges in probability to 1. Then for m > 2E[S'i], 

lim P( ^'=i ' > e-^aninn)--)) ^ jj^^ ^ _ E[F{ ^'=^ ' < p'^Ip)] = 1, 

n— i.+oo n n— > + oo n 

We conclude with (flSl). □ 
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3.2. Proof of Theoremimj Let Ti ^ /i"(r), = M+(r). then A'^^^'-Fi < A = A''=''+r < A''=''+r2. 
Define H^^'"' (resp. 112'^'"'') as the coalescent process related to A''^'' - Ti (resp. Is}"-^ + V2) with the 
time back to the MRCA r^"^'"^ (resp. r}"^'"^). Thanks to CoroharylHl 

(15) rf^") «r(^'") «r{^^"). 
Then it suffices to prove that 

(16) r<^'"^ - In Inn ^ G, and rf '"^ - In Inn G. 

This resuh was already proved in the case of Boltausen-Sznitman coalescent (or A'*^^ coalescent) 
for which we denote by r^*'"' the time back to the MRCA. Comparing the measures, we also have 

Let r^*'") be obtained from the measure division constructed n^^2 with noise measure Fi and 
main measure k^"* - Li. We define the event E := {|n(^i'"'|((lnlnri)2) > ^e-^iC"^'"")'}, mi > 0. 
Then thanks to Lemma I5t1 with some mi, we have 

(17) lim P(£;) = 1. 

n— f +00 

This result implies that the probability for min{|n('"i'")|(i)o<t<(ininn)2} to be larger than or equal 
to 7T,e~'"i(i"i"") tends to 1. The measure division construction of A''^'' coalescent is to apply measure 
l^eh _ to a given path of n^^^'"). While for n^^'"\ one needs to apply hl-^^ — Fi to a constant path 
of n singletons (see the second point in Remark 1 2. ip . Thanks to Corollarv 12.31 we fix x > 0, and n 
large enough such that Inlnn + x < (Inlnn)^, then we get 

P(t(*'"' < In In n + x) < F{t[^'"^ < In In n + x) 

<p(^(*,|n(^i-)|((inin„)^)) <inlnn + x|£;) < p(r(*'L--'"i<'"-"''J' <lnlnn + a;), 

where the notation [sj := max{m|m < s,m E Z}, s £ R. We define also \s~\ :~ min{m|m > s, m G 
Z},s e R. Notice that 

lim P(r(*'"' < Inlnn + x) =IP(G < a;). 

Hence due to (ITTI) and lnln([rie~™i^''^'""^ J) — In Inn —> 0, we also have 

lim P fr(*'L"'=""'""'°"'''J) < Inlnn + x\e) = P(G < x). 

Then we can conclude for the first convergence in (|16p . The second convergence in (jl6p can be deduced 
in the same way. Hence together with (jlSp . we prove Theorem ll.il 



4. Application 2: under condition ([5|). 

Some notations for this section: Let A be a finite measure on [0, 1] and Ai = Al[o.i/„), A2 = Al[i/„ 1] 
. ^(A.i/y) ^ x-^k{dx), .gj/J = /o (1 - (1 - xf/y - ^x{l - xY/y-^)x-'^K{dx) with < 2/ < 1. Notice 

that the definitions of and gj^^ are consistent with that of ^t'^'") and glf^ when A({0}) — 0. 

These notations help to examine carefully different measures. One thing to notice is that Ai and A2 
both depend on n. 

Here we are going to prove Theorem 11.21 Theorem 11.31 Corollary 11.51 and Corollary 11.61 Under 
condition we decompose A into A2 and Ai. The idea is to construct H^^'") using measure division 
construction with noise measure Ai and main measure A2. At first, we need to show more details 
implied by condition 
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Proposition 4.1. The following two assertions are equivalent: 
(*): A satisfies condition ^Bj); 

(**): A({0}) = and there exists a caglad (limit from right, continuous from left) function f 
[0, 1] — !• [0, 1], continuous on with /(O) — and a constant Ci > 0, such that 

(18) /x^^^i/^) = C^expi C ^dx){l - f{y)), 0<y<l. 



Proof. • Wc first assume that (*) is true. If A satisfies ([5)), tlren A({0}) — due to Remark ll .2 

For /i^"^-") 0, we liave 



9n 



(A) 



/„^(1 - (1 - x.r ~ nxjl - xr-')x-^A{dx) _ („) („) 



where /^"^ = /(") ^ Jcy"(i-(i-x)"-«:r(i-x)"^i)x^^A(dx) 

/,\ a:~^A(tia;) (•„) /.^ 2:~^A(da;) 

Notice that for n large, using monotone property, we have — „^(a ,») < < — ") 

and I " '^"^(A^)'^'^^ ^ ^2"'' — "•^"^(A,^)'^'^'' ■ Hence condition ([S]) is equivalent to 

f^, x-'^Mdx) n f^/" kidx) 

(19) lim =0,and lim „P = 0, A({0}) = 0. 



Then wc deduce that 
Indeed, for 1/y > 2 and /i(Li/!/l) ^ 0, we have 



(20) 1™ °rAi/„ -0,A({0})-0. 



/o"A('^^) ^ /o"A(rf^) < /„^/Li/.J A(rf^) ^ Ll/yJ/„^/Li/.J ^^^^^ ^^^^ ^ 

y/,(A,i/.) yjJ^-iA(dx) - ^/,^/Li/.jZ-iA(.) LI/2/J /,^/Li/.J-"'A(x) 

One thing to notice is that lim y/i'-^'^^*''' = is true for any finite A. In fact, for any positive 

y-^0+ 

number M and yM < 1, wc have 
y^(A,i/!y) x-^A{dx) ^y x-^K{dx) + y x-^A{dx) < ^ ' + / A(dx), 

Jy JyM Jy M Jy 

where both terms are as small as we want by taking M large enough and y close enough to 
0. Looking into details of ^° (A.^i'/y) when ^(A,i/y) _^ have the following equality, using 

integration by parts and lim y/i*^^'^/^^ = 0, 

j/->0+ 

j^K{dx) _ j^xx-^Kidx) _ fi^^^^^^Ux-yfi^^^^M 



Together with (pil)) . we have 



hm -Tj; — , . , , , — = 1. 



Notice that ^'^^'^^^''dx > y^''^'^/^^ and ^(^-^/y) is a caglad function. Hence there exists a 
caglad function / : [0, 1] [0, 1], continuous on with /(O) = such that 

„i,(A,i/a) 
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Notice that the fundamental theorem of Newton and Leibniz works for caglad functions whose 
primitive functions take left derivatives. Then we can conclude with (j21[) . 
• We now assume that (**) is true. In the first part, we proved implicitly that (|20|) is equivalent 
to the right-hand condition of this proposition. Hence we will use ((20|) to prove (|19p which is 
equivalent to condition ([5|) and only the first convergence in ()19|) is needed to be proved. Let 
M be a positive number and ^ < 1, jjS^'^^ ^ 0, then 



^^(A,n) 

I „{A,«/M) 
< 1- 1 - 

- M ■ 



„(A,n/A7) 

The first term is as small as we want by taking M large, and the third term ^(A,n) 

rM/n /(x) , X W(M/«) 



exp{~ j^^l^ '^^) T^fiT/TT) ■ ^'^^ e > and n large enough such that f{x) < e on Al/n]. 

(A.ii/M) 

Then > exp{-( 

Hence we can conclude. 



Then ^ > ea;p(-elnM)(l - e), which is as close as possible to 1 with e small enough 



□ 



The next corollary is immediate. 
Corollary 4.2. If A satisfies then 

( {A,n)\k 

, lim ^ L=o,Vfc>0; 

n^+oo n 

(A,n) 

• lim -f. 77T = 1,VA/ > 0; 

(A,n) 

• lim — . = l,VO<e<l. 

We should next estimate the coalescent process related to the noise measure Ai which serves as a 
perturbation to the main measure A2. At first, one needs a technical result. 

Lemma 4.3. We assume that A({0}) = 0. Let g\t^^ = (1 - [l - x)"^ - nx[l - xY-'^)x-'^ Ai{dx) in 
the spirit of (0). Then there exists a positive constant C2 such that for n large enough 

(22) glt'^>C2n'l A,{dx). 

Proof. Let A/ > 1. We write 

r-l 







(1 - (1 - x)" - nx{l - xT-')x-^Ai{dx) 

(1 - (1 - xT - nx{l - xT-^)x''^ Ai{dx) 
h+h, 
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where h = /n^(l - (1 - a;)" - nx{l - xy''''^)x-^Ai{dx) and I2 = /^(l - (1 - a;)" - nx{l 

nM 

x)^~'^)x~'^ Ai{dx) . It is easy to see that for n large, 

/•diT 1 
Ii> {n{n-l)-n{n-l){n-2)x)-Ai{dx) 
Jo 2 

> / {n{n-l)-{n-l){n-2)/M)-Ai{dx) 

.In 2 



I rTTKT 

> - r?Ai{dx) 
4 Jo 



For the seeond term, 

Notice that for n large, there exists a positive constant C{M) such that 

Hence I2 > C{M) f " n^Ai((ia;). It suffices to take C2 = min {i,C( A/)} to conclude. □ 

nM 

The following lemma estimates the coalescent process related to the noise measure Ai when A 
satisfies 

Lemma 4.4. Let A satisfy Then for any M > 0, < e < 1 and n large enough, we have 
(23) P (^|n(^-")|(-^) <n^ne^= o{n-'). 

Proof. If Jq^"" A{dx) ~ with some uq > 1, then for any n > no, Ai = and hence |n''^i'"''|(t) = n 
for any t > 0, which proves this lemma. In consequence, one needs only to consider the case where 
/p ^" A{dx) ^ for any n > 1. 

We recall gi^^^ defined in Lemma [4.3l Thanks to Proposition l2.5l where we pick up the notations, n— 
^jv(Ai,n,^(^ ) ^ |n(Ai,»)|(_M^)^ ^herc iV(Ai,n, -^j^) is Poisson distributed with parameter 

^^(k,n) independent of {W^"'^)i>l which are i.i.d copies of xj"^^'""*. Then we have, for n large. 



M 



i>(|n(Ai'")|(-i^) < n-ne) < P(?i- ^ W^^"^ < n - ne) 

i=l 



i=l 



{ne - 4^E[<'])2 (n. - 4^E[I^("'])2 



where the second inequality needs ne — ^"^^f^^^ W\W[^^] > which is justified by the following calculus: 
Notice that due to Proposition 12.51 and Lemma [4.31 for n large enough, 

where C2 is the positive constant in Lemma 14.31 
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Notice that ([5]) gives ^^'^^,1) < ^^^"a_„) — >■ 0. Then together with (^5]) . we have 

Hence ne - ^^J^E[PF|"^] x ne and ([Ml) gives 

Then we conclude for (l23l). □ 



The next Lemma helps to explore a property of the main measure A2 when A satisfies ([5]). Notice 
that Jp x~^A2 < +00, hence we can use the second coupling construction in subsection 3.1 for n(^2,n) 
and n('^2,n)^ T^^Q pjgjj lYiQ notation p(^2,n) related to A2 for the probability that singleton {1} of 
jj(A2,n) j^qI; ]-,g coalesced at its 1-st selection time, as p(^'"') related to A in Remark 1 3. II 

Lemma 4.5. Let A satisfies (0). Then 

(26) lim p(A2,«) = 0. 

n— >+oo 

Proof. Let (?y.|'^^'')i>i, (ej-^^^)i>i be associated to A2 as defined in the second construction in subsection 
3.1, and pick up the definition (A,|^^'')i>i in Remark |3. II which shows that 



i=l 

It is easy to see that E[A|'^^^(1-A('^^^)"-1] = E[A('^^'(1-A,|'^^^)"-1], where A^^^^^ = rji^'^Il^^^il- 

3 

Ts := minlzlA^"^^' < 1/s} 



It is obvious that (A|^^'')i>i is a Markov chain. For s > 0, we define a stopping time 



= min{i| -^H^ - Vj^^^) > InsTyJ^'^} 



i=2 



min{^ + 1| - ^ ln(l - Vj+i) > \nsTj[^''>}. 



Then we get 



i=l 

(27) =nY. a|^^'(i-a|^^v-i + ^ A|^^'(i-A|^^^)"-i]. 



Notice that x{l - .t)"-i < ^ , if i < x < 1 and a;(l - a-)"-^ < x, if < a; < Then ([271) gives 



+00 

(28) p(A2,«) < + ^ Al'^^^^] < E[ 



r„-l, 1 



(A2) 



To calculate E[t„], we use the renewal theory. Let n = E[— ln(l — 'q"^'^^)]. 
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(1) If /i < +00. Wc denote by F{t) the distribution function of — ln(l — and X an indepen- 

dent random variable with density function ^(1 — F{t))lt>o. We define a new Markov chain 

- E]=2 - Vj^"^)h>i and = mm{i\X - ^j.^i ln(l - f]\l\^) > Ins)} for s > 0. It is 
clear from the definitions of and that for any e > 



Then 



E[t' (A2) |X = e] = E[r^ea;p(-e) - !]• 



> P(0 < X < e)E[r„,,p(_,) - 1] + E[r' lx>e], 



which implies that 



^Tnexpi-e)] < ^ 



<X <e) 

Due to (4.4) and (4.6) in [[M], p.369], we have 

EK] = — ,Vs > 1. 
Notice that 77^"^"' > i, hence njyj'^"' > 1. Therefore, ^ gives 

. ^ ^[^xKe),;-^-'] , ^ _ E[ln(nea:p(e)7y(^^))] 
^^^"J - P(0 < X < e) + /iP(0 < X < e) + 

Notice that for any < x < 1, we have — ln(l — a;) > a;, hence fi > E[?/[^^'] = fi'^^'"\ Then 
30| implies 

E[t„] ^ E[\nnT][^^'^]+e 1 
n ~ EHi'^'^]P(0 < X < e) 
Using it suffices to prove that: 

hm E[n?7^ = +00, and hm — — ^ — , = 0. 



n— >-+oo 



(A2)l 



It is easy to see that, using there exists a positive constant C3 such that £[^775^ 
"^-2A(dx) — ^3 (A) 1 for any n > 3. Hence E[n?7j ^] tends to +00 since A satisfies (|5|). 
For the second convergence, we fix M > e. Then, 

E[ln(n7,i^^')] _ K[ln(-^^')1„,<-^)>M] +E[ln(nr;(^^))l„^,A.,<,,] 



< 



< 



C[ln(7i77i^^')l^^(A,,>J 



lMnr^[^'''>)l ,A,)^J InM 



InM InAf 
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The last inequality is due to the fact that for any x > M > e, we have < ^^jf^- Since M 

, , , , ,. Eflnf™!^^^)! „ 

can be chosen as large as we want, then lim — - — ^ — , . , = 0. Hence we can conclude. 

(2) If ^ = +00. We define (^|'^''')i>2 := (51„(A2)>i + ?7|'^'''1„(A2) , i )i>2 and for s > 0, := 

'li —2 U ^ 2 

min{i + -ln(l " Vj^i) > hisjyj'^"^}. Notice that E[-ln(l - f][^^^)] < +oo, then we 

return to the first case and get (|3ip by replacing r„ by fn and keeping the same rj^^^'^ but 
with different X (depending on f]\^^\i > 2). We see that the closer 7?-^^^ is to 1, larger the 
— ln(l — fi[^^^) and hence r„ <C f„. Then we can conclude. 

□ 

The next corollary is straightforward. 

Corollary 4.6. Let A satisfies ^ and we define the probability p(^2,n,k) ^.^lated to n^^^-") for at 
least one among the first k singletons being not coalesced at their 1-st selection times. Then 

(32) p(A2,«,fc) < ^p(A2,n)^ 

and lim p(A2,",fc) ^ q. 

n— >+oo 

Before proving Theorem 11.21 we first give a trivial lemma whose proof is omitted. 

Lemma 4.7. Let T\ be a realization o/Il'^i'"' and t[^'^^^ be the external branch length o/{l} after 
the addition of A2 to li using measure division construction. Assume that for a time t > Q, there 
exists 2 < m < n, such that m < |n|(i) < n and {1} ^ n(t). Then after the adding 0/ A2 to 11 using 
measure division construction, for any < t' < t, we have 

Proof of Theorem 11.21 

Proof. Since the case x^^A{dx) < +00 has been proved in [23], we assume that x^^A{dx) = +00. 
The theorem is equivalent to say that 

(33) lim P(r(^^") > ^^) = exp(-i),Vt>0. 

Let T^^'""^ be obtained from the measure division constructed II^^j"'' with noise measure Aiand main 
measure A2. Let < e < 1. Notice that under the event {|n(^i'")|(-f^) > n— ?7e}, the probability for 
singleton {1} to be coalesced at time -jjij-^ is less than or equal to using exchangeability property 
(since there are at most [2ne] singletons merged). We denote by K„(t) :— P(|n(^i'")|( ^^^^^ ) < n — ne) 
and Kn{t) = o{n~^) due to inequality (fSSj) . Then for n large enough, we have f{E') > (1 — K„(i))(l — 
J-^^), where 

= {|n<^-')|(^) > n - ne] ^{t['^-^ > -^}. 

Lemma [4.71 gives that 

(34) P(Tf J) > > p(r(A^") > -±-^\E') > P(Tf > -1^). 

We prove at first lim p(p^^^'"-'> > _^ — — exp{~t). Recall that 7^^"^^'"'^^ is the 1-st selection 
time of {1}. The second construction in subsection 3.1 gives that 



(35) P(i(^^^"'i^ > + P(A^'") > p(Ti(A^^") > —L-^) > P(l(^^'"'1' 



> 
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It is revealed in the same construction that P(L^^^'"'^'' > -jji-^) — exp{~t). Hence using Lemma 
we get 

lim P(Tf > ^) = exp{-t). 
We obtain also lim PfT, ^ " ""^ > —r, — r) — exp(—t), using lim tt — ^ — = 1 which is due to 



Corollary |4?2 



Then dM]) gives that Um f(T^^'"^ > | \E') = e.T|3(-t). Since F{E') is as close as we want 



to 1 by taking e small and n large, we get lim p^y'"^'") > __ — _) — exp{—t), which allows to 
conclude. □ 

Proof of Theorem 11.31 
Proof. We prove instead for fc G N: 

(36) ^(A,„)(j.(A,„)^^(A,„)^... ^j.(A,n)^ ^ (ei,e2,--- ,efc), 

which is equivalent to (0 (see Billingsley [[3, P-19]). We will give the proof for k = 2 and leave the 
easy extension to readers. We have to prove: 

rp{A,n} ^ t-2 

n-s-+c30 ' " ~ ^(A,ri) '2 - ^( 

Similar to the event E' defined in the proof of Theorem 1 1.21 we define another event, with < e < 1 



(37) Jim_^P(T(^'"' > 7(^,Tf > ^) = exp{-h - h)yO < h < t. 



E" = {|n^'^-)|(^) > n-MHlT^-^ > > -^,}. 

Recall that {|n^'^^'"^|( (a,„) ) > n ~ ne} implies that there are at least ?? — \2ne] singletons at 
time . For n large enough, using exchangeability, we have P(£"') > -^^ — t|t — ^(1 — K„(t2)), where 

Kra(i2) = Pdn*^^^'"^ [( ^(i''^) ) < n — ne). For e small enough and n large enough, we have P{E") as 
close as we want to 1. 

^(A,n — ne) 

Similar to using the event i?" and also lim r-r — r — = 1, we need only to get the following 

convergence to prove ([57)1: 



Using the second construction in subsection 3.1, due to the same arguments to get ([35]), we have 

P(L(A2.«4) > 4A„„4) > ^.^) +p(A.,n,2) 

^ 1 - ^(A.n) ' 2 - ^(A,n) 

> p(j.(A..n) > tl iA,,n) > > p(Lf-"-l) > ^^^^.^n,!) > 

- ^ 1 - ^(A,n) ' 2 - ^(A,«) > - \ 1 - ^(A,n) ' 2 - ^(A,n) 

Using Corollarv 14.61 it suffices to obtain the following convergence to conclude: 

Indeed, adopting the notations in the second construction in subsection 3.1, within [0, there 
are Ni marking times and for {jjr-^, jjr^], there are N2 marking times. Ni and N2 are independently 

Poisson distributed with parameter respectively '^(^,,1) and jjk-^ • Then we get 
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-(A2.n,l) ^ j-{A2,n.l) ^ h 

^1 ^^(A,„)'^2 > ^(A,, 

T^i n A^2)\2rrNi+N2 /I ^(A2) 



exp 



where the last equaUty is due to the moment generating function of Poisson distribution. Recall that 



Then we conclude for (|39|) . □ 

Proof of Corollary 11.41 

Proof. We prove at first the case with one external branch length. Lemma 5.7 in [11] allows to 
just prove that for any fc e N, sup{E[(^('^^")Ti^'^'"')'=] |n > 2} < +00. Let M > 0,0 < e < 1, 
/3„ = |n'^^'")|( ^(y„) ) and uq := min{i|^(') > 0}. To avoid invalid calculus, we set ^(^'") = 1 if n < 
Using Markov property, we have 



no. 



where J"^^^'"^ T[^'"'\n > 2 and T^'^'"'^ is independent of everything. Then for ne>no, 
jE[(^(A,„)y(A,n)^fcj < e[(A/ + a.(^'")t('^'^")i^,,,„,^<.,„,^^^)''1 < (2M)'= +E[(2A.('^'")f(^'^")l^,,,„,^<.,„,^^^) 



{2Mr + exp(-ili|^)E[(2^(^.")T("))'= 



< , , ^^^^^^^ 



(40) 



(A.n) 

+ p(^(A,„)j,(A,„) ^ ^^)(2^^)'^-max{E[(M(^^^')Tf G 1]} 

+ P(/?„ < ne)E[^(2i;5^£))'(/^^^''"^^^'"')'l/3" < "^]' 



where ea;p(— ^{f"n) ) in the second term at right of the last inequality is the probability for no coales- 
cence within [0, The third term at right of the last inequality is due to the increasing property 
of ^(^'") on n when n> uq. The fourth term is due to exchangeability which says that the probability 
for {1} not to be coalesccnt at time -j^^-^ when there exist only /3„ blocks is less than One needs 

the following three estimations to prove the boundedness of (E[(/.i'^'^'"''T^^'^'"'')''])„>2. 
• Estimation of exp{~ ) : Notice that for n >no, 

gn""^ _ /o'(l - (1 - - "2^(1 - .T)"-i)x-2A(d.T) ^ - (1 ~ xr - nxjl - x)"'! A(da:) ^ e-2 
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And if 2 < n < uq, we have exp{— ^{a"„) ) = exp{—Mg'^^) q. Hence if M large, we 

have, for any n > 2, 

• Estimation of P(/i(^^")r|'^'"^ > M)(2-t^^)'= : Due to CorollarylOl we get Urn — . = 1, 
and Theorem [L2] gives Km P(^(^^")r;^'^'"' > M) = exp{-M). Hence by taking M large, we 

n— f +00 

have for any n > 2, 

,.(A,n) 1 

(42) P(^('^^")rf > M)(2^^)^ < -. 

• Estimation of ^(2 ^(A,g,i) < • Using the notations in Proposition |4?1] for /?„ > no, we 
have 

Let ni > no such that for any n > ni, we have /(1/n) < Hence for any a,b > ni, 
\Z^f\b] — 2- This ni can be found since /(1/n) tends to as n tends to +oo. Then (|43p 
implies, for /3„ > ni, 

<2(^)^. 



Hence if ni < /3„ < ne and e < 4^2fc-i^ 

^(2^)^- < 4^^)1/2 <4'^X^)V2<1. 
If /3„ < ni, due to Corollarv l4.2[ when n large enough, we have 

/Zlifoi^^ < - 

n ^ tii^M> - 2 

In total, when n is large enough and /?„ < ne, then 

Using (gni), dH]), dUl) and (011), we get 

E[(^('^^")ri^^'"V-] < -(27^)*^ + -max{E[(/i(^^^'Ti('^'^'^)'^-]|.7 e [ne,n - 1]} + -E[^('^^'3„) j.^(A,/3„)^fc|^^ ^ 
3 3 3 

(45) < -{2Mf + - max{E[(^i(^^^'fi('^'^'^)'^-]|j < n - 1}. 

3 3 

The above inequality is valid for a large M, e = 4-2*^-1 and n > ni. Let C4 > max{E[(^('^-')r;J'^"''^)'=], 4(2Af)'=|2 < 

j < m}, then for any n > 2, C4 > E[(^('^'")Ti^'^'"^)'=] using (|45|). Then we can conclude. 

The case of multiple external branch lengths is merely a consequence of the case with one external 
branch and also of Lemma 5.7 in [11) . Theorem II .31 and Cauchy-Schwarz inequality. □ 

Proof of Corollary 11.51 
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Proof. Corollary O shows that lim E[f/^''''> L^^l'"'' /n] = 1 and lim Var(^('^'")L|^^i"V") = 0> 



hence converges in to 1 which deduce the convergence in probability. □ 

Before proving Corollarv ll.61 we study at first a problem of sensibility of a recurrence satisfied by 
(T^'^'"')„>2. More precisely, if a„ = E[Ti^'^'"^], the n a„ satisfies a recurrence (see [S]): ai ~ 0, and for 
n > 2, we have 

, , fc — 1 
(46) an = c„ + } Pn.k flfc, 

k=l 

where (c„)„>2 = (-fiT)n>2 and p„ fc = pi^/!. Due to Theorem 11.41 we have lim ^^'^'"^a„ = 1. The 

question is as follows: what is the limit behavior of a„ if we set initially the values of (ai)i<i<„Q with 
no > 1 without using (j46p and replace c„ by = + o(-py)? It is answered in the next lemma. 

Lemma 4.8. Let {a[)i<i<no be Uq real numbers and for n > Uq 



ji-i 



fc - 1 , 



(47) a; = c; + Vp„,fc Cfc, 

— ' n 

k=l 

where {Cn)n>no T-s a sequence which satisfies c'„ ~ + o{-^). The 

lim H^^'"^a'„ = 1. 

n— f-f-oo 



Proof. We fix e > and let > no such that c'^ < ^^j^ for n > n^. We set M = max{|a^[, ai|l < i < 
nj. 

Let us firstly look at (|46p which has the following interpretation using random walk: A walker 
stands initially at point n, then after time c„, he jumps to point ki with probability Pn.kn then after 
time ^^^"'" Cfc^, he jumps to k2 with probability Pfci,fc2i and then after time ■^^^^—^^^^--^Cfej, he jumps 
to the next point, etc. If he falls at point 1, then this walk is finished. It is easy to see that a„ is the 
expectation of the total walking time. 

The recurrence (|T7)) has the same interpretation. The difference is that one should stop the walker 
when he arrives at a point i within [l,no] and one adds to the walk time (notice that a[ can be 
non-positive). To estimate a^, we use a Markov chain {Wi)i>o to describe the jumping structure of 
(gni) and dUl) : W^o = n, 

• If Wi = k with k > n^, then Wi^i = k' with probability Pk,k', where 1 < fc' < fc — 1; 

• If Wi < n^, then we set Wj = Wi for any j > i + I. 

Notice that the jumping dynamics of both recurrences is characterized by (Wi)i>o until arriving at a 
point within [l,ne]. And also we see that {Wi)i>o is the block counting process jn'"^'")! stopped when 
the first time arriving within [l,ng]. 

Let <j„ = min{i|Wi = Wi+i} , C^^ = ^i=o^ 'w^^ ^^^"^ -^"J" *° ^® time to <;„ of the random 

walk related to (06]) and T^^ being that of (gTl). 

It is easy to see the following relations: 

an - ME[C,J < E[r,J < a„; a'„ - M^C^J < E[T^J < a'^ + A/E[C^J; E[r^'J < (1 + e)E[r,J 
Notice that E[QJ < ^ and due to CoroUaryS^ we have lim — = 0. Hence lim ME[Q„]^(^'' 

^ n— )- + oo n n— >+oo ^ 

0. Then we can conclude that for n large, aj^ < (1 + 2e)a„. In the same way, we can prove also 
> (1 — 2e')a„ for another small positive number e' with n large enough. Hence we deduce the 
lemma. □ 

Proof of Corollary 11.61 
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Proof. Let bn — E[/i'^^'")L'^^'")/n]. Then looking at the first coalescence of the process 

n(A.„) 

we 

have, 

(48) b, = 0;6„ = L + > 2. 

9n k=l ^ 

If for some k, /x*^^'*^^ = 0, then we set /x*^^'*^^ = 1. To use Lemma l48t we write (|48]) as: 

(49) foi = 0;6„ = + ^p^^^——bk + 2^P»,fc (A.fc) fefc,^ > 2. 

5" fc=i ^ fc=i ^ 



We prove that ^;'^^p„,fc^i^j(^ = o(i^). Indeed, due to ©, let a = ^l{l-{l-xY-^)x-^k{dx) 
and M > 0, then 



n 



(50) P(X(^'") > Ma) < < . 

, , a f„^/"(7i-l)A(da;)+M^'^'"^ M^'^^"^ 

Using Corollarv I4.2[ we have limsup— < lim — = 0, lim — r-r -rj-^ 

„^+oo n ri-i-H-oo n ri-i.+oo ^(A/ii i\ia} 

1. Then for n large enough, 

^(A,n) ^["^/-^J ^(A,n) ^(A,n) 

Z^^^".fc (A,fc) - 2^ ^"^^^^(A^fc) + ^"^"n/.(A,fc) 
fc=l k=l fc=[n-A/aJ+l 

< P(xj'^^"^ > i\/a)E[ ^ \x[^'^^ > Ma] ^ 



(A,n-X<''-">) '1 - ^ ^(A,«-A/a), 



where the first term at right of the the last inequality is due to (jSOp and can be as small as we want 
w.r.t ^'(Ai' when M is large enough. Notice that n^^ = o( ^\a)' ) due to ([5]). Then the second term 

= "(^^ ^'I'^S „ll?oo ^(A,«-A/a) = 1- Then ^,^1 Pn,ki^^ = o(^)- 

In fact, it suffices to prove that {bk)k>2 are bounded, since in this case, pn^k ^^(a.^) ^fc = 

o( ^'^a"' ) and we apply Lemma l48l to (|49)). 

To control X^I—i P",fc nliii^l) we construct another recurrence: 

(51) b[ =0X = ^ + j:Pn^k ^'lX';"\ ln > 2. 

5" fe=i ^ 

where C is a positive number. If C = 1, this is exactly a transformation of the recurrence (j46p . Let 
Af'(C) = sup{feJJ. Then it is easy to see that M'{C) CM'{1). Let uq > 1, such that for n > hq, 

we have EZl PnM^^i^M'il) < i^^. Then for C > 2,n > no, 

{A.n) „(A,n) C/i(^'") 

(52) ^ + E^-^£(Ai) ^•^'(^) ^ 

9n fc=l '^'^ 5" 

For 2 < 71 < no, we set C large enough such that 
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(53) ^_^ + ^p„,^^niax{6,|l <i <no} < 

5" fe=l 9n 

Comparing the cocfScients and initial values of reeurrences ([^^ and (|5ip using ([5^ and (|53p. we 
deduce that 6n < < M'{C). Hence we can conclude. 

□ 
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